Content Classification of Multimedia Documents using Partitions of Low-Level Features

نویسندگان

  • Edda Leopold
  • Jörg Kindermann
چکیده

Audio-visual documents obtained from German TV news are classified according to the IPTC topic categorization scheme. To this end usual text classification techniques are adapted to speech, video, and non-speech audio. For each of the three modalities word analogues are generated: sequences of syllables for speech, “video words” based on low level color features (color moments, color correlogram and color wavelet), and “audio words” based on low-level spectral features (spectral envelope and spectral flatness) for non-speech audio. Such audio and video words provide a means to represent the different modalities in a uniform way. The frequencies of the word analogues represent audio-visual documents: the standard bagDigital Peer Publishing Licence Any party may pass on this Work by electronic means and make it available for download under the terms and conditions of the current version of the Digital Peer Publishing Licence (DPPL). The text of the licence may be accessed and retrieved via Internet at http://www.dipp.nrw.de/. First presented at the International Conference on Content-Based Multimedia Indexing 2003, extended and revised for JVRB of-words approach. Support vector machines are used for supervised classification in a 1 vs. n setting. Classification based on speech outperforms all other single modalities. Combining speech with non-speech audio improves classification. Classification is further improved by supplementing speech and non-speech audio with video words. Optimal F-scores range between 62% and 94% corresponding to 50% 84% above chance. The optimal combination of modalities depends on the category to be recognized. The construction of audio and video words from low-level features provide a good basis for the integration of speech, nonspeech audio and video.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Collection Information To Improve Low Level Feature Based Multimedia Retrieval

We propose a statistical representation for media documents called feature terms. This approach normalises feature distributions in a collection and leads to a mixed query/document model for multimedia retrieval. Two related problems are addressed: (1) how to extract discrete feature terms from continuous low level features; (2) how to rank the relevance between documents. TRECVid 2006 video co...

متن کامل

Region-based Image Classification

Image classification using low-level features is always a challenging research in computer vision. Recent years, content-based image retrieval has emerged as an important area in computer vision and multimedia computing. In this project, I'm going to introduce and implement an approach [1] that can better represent the image using low-level features and then I apply this method in image classif...

متن کامل

A Unified Approach to Indexing Multimedia on the Web

Indexing multimedia Web documents can be regarded as an important part of Web engineering, a concept first proposed [19] by one of the authors and his collaborators in 1998 at the World Wide Web WWW7 conference in Brisbane, Australia. Contentbased indexing of multimedia has always been a challenging task. The enormity and diversity of the multimedia content on the World Wide Web (WWW) adds anot...

متن کامل

Are we Ready to Embrace the Semantic Web?

The aim of the semantic web is to describe resources using metadata elements that can be processed or interpreted by machines. MPEG-7 [1] is the result of a standardisation effort to annotate multimedia documents. It offers a rich suite of metadata descriptors for describing these documents at various levels of abstraction from low level features to high level semantics. Owing to the proliferat...

متن کامل

An Improvement in Support Vector Machines Algorithm with Imperialism Competitive Algorithm for Text Documents Classification

Due to the exponential growth of electronic texts, their organization and management requires a tool to provide information and data in search of users in the shortest possible time. Thus, classification methods have become very important in recent years. In natural language processing and especially text processing, one of the most basic tasks is automatic text classification. Moreover, text ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JVRB

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2006